Picture for Yuheng Zhang

Yuheng Zhang

TagSpeech: End-to-End Multi-Speaker ASR and Diarization with Fine-Grained Temporal Grounding

Add code
Jan 11, 2026
Viaarxiv icon

RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied Intelligence

Add code
Dec 31, 2025
Viaarxiv icon

Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation

Add code
Nov 10, 2025
Viaarxiv icon

Identifying and Calibrating Overconfidence in Noisy Speech Recognition

Add code
Sep 08, 2025
Viaarxiv icon

ArtVIP: Articulated Digital Assets of Visual Realism, Modular Interaction, and Physical Fidelity for Robot Learning

Add code
Jun 06, 2025
Viaarxiv icon

CryoCCD: Conditional Cycle-consistent Diffusion with Biophysical Modeling for Cryo-EM Synthesis

Add code
May 29, 2025
Viaarxiv icon

Panoramic Out-of-Distribution Segmentation

Add code
May 06, 2025
Viaarxiv icon

Statistical Tractability of Off-policy Evaluation of History-dependent Policies in POMDPs

Add code
Mar 03, 2025
Viaarxiv icon

Improving LLM General Preference Alignment via Optimistic Online Mirror Descent

Add code
Feb 24, 2025
Figure 1 for Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Figure 2 for Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Figure 3 for Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Viaarxiv icon

Teaching LLMs to Refine with Tools

Add code
Dec 22, 2024
Figure 1 for Teaching LLMs to Refine with Tools
Figure 2 for Teaching LLMs to Refine with Tools
Figure 3 for Teaching LLMs to Refine with Tools
Figure 4 for Teaching LLMs to Refine with Tools
Viaarxiv icon